Real-Time Speech Recognition System

نویسندگان

Hy Murveit

Mitch Weintraub

چکیده

PROJECT GOALS SRI and U.C.Berkeley are developing hardware for a real-time implementation of spoken language systems (SLS). Our goal is to develop fast speech recognition algorithms and supporting hardware capable of recognizing continuous speech from a bigram or trigram based 10,000 word vocabulary or a 1,000 to 5,000 word SLS system. RECENT RESULTS The special-purpose system achieves its high computation rate by using special-purpose memories and data paths, and is made up of the following several components: • A special-purpose HMM-board with eight newly designed integrated circuits that does the HMM inner-loop processing to implement the word-recognition algorithms. • An output-distribution board made of off-the-shelf components for computing HMM discrete-density state-output probabilities. • A multi-processor TMS32030 board for computing the statistical language processing. This board has a custom high-speed interface to the HMM-board. • A general-purpose CPU board to perform system control. • A DSP board with A/D convertor for computing the feature extraction. • A Sun workstation for computing the spoken language system database retrieval and human machine interface. • Completed the construction of a working hardware prototype. This prototype has been demonstrated running the Resource Management (RM) task as well as the Airline Travel Information System (ATIS) task. • Began intensive use of the hardware for a real-time Airline Travel Information System (ATIS) task. • Completed the design and construction of a second generation multiprocessor TMS32030 grammar processing board. Testing is currently in progress. • Revised and corrected errors in several of the custom VLSI chips that are used for the HMM word-recognition processor. • Complete the construction and testing of the second generation multiple-processor TMS32030 board with a high I/O bandwidth to interface with the special purpose HMM-board. • Implement multiple types of grammars using this hardware. • Collect data about man-machine speech interactions using the real-time hardware. • Integrate the real-time recognizer into our research to shorten the development cycle for new systems • Evaluate the current architecture to determine the computational and algorithmic bottlenecks. • Deliver a hardware prototype to DARPA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Hidden Markov Model is a popular statisical method that is used in continious and discrete speech recognition. The probability density function of observation vectors in each state is estimated with discrete density or continious density modeling. The performance (in correct word recognition rate) of continious density is higher than discrete density HMM, but its computation complexity is very ...

متن کامل

Real Time Implementation of a License Plate Location Recognition System Based on Adaptive Morphology

License plate recognition (LPR) by using morphology has the advantage of resistance to brightness changes; high speed processing, and low complexity. However these approaches are sensitive to the distance of the plate from the camera and imaging angle. Various assumptions reported in other works might be unrealistic and cause major problems in practical experiences. In this paper we considered ...

متن کامل

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

متن کامل

طراحی و پیاده‌سازی سامانۀ بی‌درنگ آشکارسازی و شناسایی پلاک خودرو در تصاویر ویدئویی

An automatic Number Plate Recognition (ANPR) is a popular topic in the field of image processing and is considered from different aspects, since early 90s. There are many challenges in this field, including; fast moving vehicles, different viewing angles and different distances from camera, complex and unpredictable backgrounds, poor quality images, existence of multiple plates in the scene, va...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1991

Real-Time Speech Recognition System

نویسندگان

چکیده

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

Real Time Implementation of a License Plate Location Recognition System Based on Adaptive Morphology

Presentation of K Nearest Neighbor Gaussian Interpolation and comparing it with Fuzzy Interpolation in Speech Recognition

طراحی و پیاده‌سازی سامانۀ بی‌درنگ آشکارسازی و شناسایی پلاک خودرو در تصاویر ویدئویی

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

عنوان ژورنال:

اشتراک گذاری